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Attorney Docket 42P18108 

DYNAMIC ROUTE DISCOVERY FOR OPTICAL SWITCHED NETWORKS USING 

PEER ROUTING 



CROSS REFERENCE TO RELATED APPLICATIONS 

[0001] The present application is related to U.S. Patent Application No. 10/126,091, filed 
April 17, 2002; U.S. Patent Application No. 10/183,111, filed June 25, 2002; U. S. Patent 
Application No. 10/328,571, filed December 24, 2002; U.S. Patent Application No. 10/377,312 
filed February 28, 2003; U.S. Patent Application No. 10/377,580 filed February 28, 2003; U.S. 
Patent Application No. 10/417,823 filed April 16, 2003; U.S. Patent Application No. 10/417,487 
filed April 17, 2003; U.S. Patent Application No. (Attorney Docket No. 42P16183) filed May 19, 
2003, U.S. Patent Application No. (Attorney Docket No. 42P16552) filed June 18, 2003, U.S. 
Patent Application No. (Attorney Docket No. 42P16847) filed June 24, 2003, U.S. Patent 
Application No. (Attorney Docket No. 42P17373) filed August 6, 2003, and U.S. Patent 
Application No. (Attorney Docket No. 42P17541) filed October 22, 2003. 

FIELD OF THE INVENTION 

[0002] Embodiments of the present invention relate to optical networks in general; and, more 
specifically, to techniques for dynamic route discovery for optical-switched networks. 

BACKGROUND INFORMATION 

t0003] Transmission bandwidth demands in telecommunication networks (e.g., the Internet) 
appear to be ever increasing and solutions are being sought to support this bandwidth demand. 
One solution to this problem is to use fiber-optic networks, where wavelength-division- 
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multiplexing (WDM) technology enables the same physical link to transport multiple pieces of 
data concurrently. 

[0004] Conventional optical switched networks typically use wavelength routing techniques, 
which require that optical-electrical-optical (O-E-O) conversion of optical signals be done at the 
optical switches. O-E-O conversion at each switching node in the optical network is not only a 
very slow operation (typically about ten milliseconds), but it is very costly, and potentially 
creates a traffic bottleneck for the optical switched network. In addition, the current optical 
switch technologies cannot efficiently support "bursty" traffic that is often experienced in packet 
communication applications (e.g., the Internet). 

[0005] A large communication network can be implemented using several sub-networks. 
For example, a large network to support Internet traffic can be divided into a large number of 
relatively small access networks operated by Internet service providers (ISPs), which are coupled 
to a number of metropolitan area networks (Optical MANs), which are in turn coupled to a large 
"backbone" wide area network (WAN). The optical MANs and WANs typically require a higher 
bandwidth than local-area networks (LANs) in order to provide an adequate level of service 
demanded by their high-end users. Furthermore, as LAN speeds/bandwidth increase with 
improved technology, there is a corresponding need for increasing MAN/WAN 
speeds/bandwidth. 

[0006] Recently, optical burst switching (OBS) schemes have emerged as a promising 
solution to support high-speed bursty data traffic over WDM optical networks. The OBS scheme 
offers a practical opportunity between the current optical circuit-switching and the emerging all 
optical packet switching technologies. It has been shown that under certain conditions, the OBS 
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scheme achieves high-bandwidth utilization and class-of-service (CoS) by elimination of 
electronic bottlenecks as a result of the O-E-0 conversion occurring at switching nodes, and by 
using a one-way end-to-end bandwidth reservation scheme with variable time slot duration 
provisioning scheduled by the ingress nodes. Optical switching fabrics are attractive because 
they offer at least one or more orders of magnitude lower power consumption with a smaller 
form factor than comparable O-E-O switches. However, most of the recently published work on 
OBS networks focuses on the next-generation backbone data networks (i.e. Internet wide 
network) using high capacity (i.e., 1 Tb/s) WDM switch fabrics with a large number of 
input/output ports (i.e., 256x256), optical channels (i.e., 40 wavelengths), and requiring 
extensive buffering. Thus, these WDM switches tend to be complex and very expensive to 
manufacture. In contrast, there is a growing demand to support a wide variety of bandwidth- 
demanding applications such as storage area networks (SANs) and multimedia multicast at a low 
cost for both local and wide-area networks. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0007] Non-limiting and non-exhaustive embodiments of the present invention are described 
with reference to the following figures, wherein like reference numerals refer to like parts 
throughout the various views unless otherwise specified. 

[0008] Figure 1 is a simplified block diagram illustrating a photonic burst-switched (PBS) 
network with variable time slot provisioning, according to one embodiment of the present 
invention. 
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[0009] Figure 2 is a simplified flow diagram illustrating the operation of a photonic burst- 
switched (PBS) network, according to one embodiment of the present invention. 

[0010] Figure 3 is a block diagram illustrating a switching node module for use in a photonic 
burst-switched (PBS) network, according to one embodiment of the present invention. 

[0011] Figure 4 is a diagram illustrating a generalized multi-protocol label switching 
(GMPLS)-based architecture for a PBS network, according to one embodiment of the present 
invention. 

[0012] Figure 5 is a block diagram illustrating GMPLS-based PBS label format, according to 
one embodiment of the present invention. 

[0013] Figure 6 is a schematic diagram illustrating an exemplary set of GMPLS-based PBS 
labels employed in connection with routing data across a GMPLS-based PBS control network. 

[0014] Figure 7 is a block diagram illustrating message flows in connection with RSVP 
messages. 

[0015] Figures 8a, 8b, and 8c are data structures corresponding to an RSVP-TE-based Path 
message including extensions to support a coarse-grain resource reservation mechanism in 
accordance with one embodiment of the invention. 

[0016] Figure 9 is a data structure corresponding to a generalized PBS label request object of 
the Path message data structure of Figure 8a. 

[0017] Figures 10a, and 10b are data structures corresponding to an RSVP-TE-based Resv 
message including extensions to support the coarse-grain resource reservation mechanism in 
accordance with tme embodiment of the invention. 
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[0018] Figure 1 1 is a data structure corresponding to an RS VP-TE-based PathTear message 
including extensions to support tear down of resource reservations in accordance with one 
embodiment of the invention. 

[0019] Figure 12 is a data structure corresponding to an RS VP-TE-based ResvTear message 
including extensions to support tear down of resource reservations in accordance with one 
embodiment of the invention. 

[0020] Figure 13 is diagram illustrating a data structure corresponding to a sender descriptor 
object and a flow descriptor object that includes a field containing a bandwidth % value used to 
request reservation of resources supporting a % of the bandwidth provided by such resources. 

[0021] Figures 14a and 14b collectively comprises respective portions of a flowchart 
illustrating logic and operations performed by edge nodes and switching nodes during 
initialization and continuous operations of peer-to-peer routing, according to one embodiment of 
the present invention. 

[0022] Figure 15 is a schematic diagram that is analogous to the GMPLS-based PBS control 
network of Figure 6, wherein the switching network nodes have been replaced with routers, and 
including details of an exemplary routing tree generated at a switching node. 

[0023] Figure 16 is a schematic diagram corresponding to the diagram of Figure 15, further 
showing routing trees for edge nodes A and D. 

[0024] Figure 17 is a diagram illustrating routing tree table data including a portion of the 
possible lightpaths between nodes A and D of Figures, 15, and 16. 
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[0025] Figure 18 shows a portion of an exemplary LINK_AVAILABILITY table in which 
link availability data is stored, according to one embodiment of the invention. 

[0026] Figure 19 is a schematic diagram illustrating components of a Path message 
employed in an example lightpath reservation process corresponding to Figure 20. 

[0027] Figure 20 is a diagram illustrating an exemplary resource reservation table hosted by 
node B of Figures 15 and 16 and containing data used in connection with explaining the lightpath 
reservation process of Figures 14a and 14b. 

[0028] Figure 21 is a schematic diagram illustrating components of a Resv message 
employed in an example lightpath reservation process corresponding to Figures 14b. 

[0029] Figure 22 is a schematic diagram of a BGP router with co-located PBS label edge 
router node architecture, according to one embodiment of the invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

[0030] In the following detailed descriptions, embodiments of the invention are disclosed 
with reference to their use in a photonic burst-switched (PBS) network. A PBS network is a type 
of optical switched network, typically comprising a high-speed hop and span-constrained 
network, such as an enterprise network. The term "photonic burst" is used herein to refer to 
statistically-multiplexed packets (e.g., Internet protocol (IP) packets or Ethernet frames) having 
similar routing requirements. Although conceptually similar to backbone-based OBS networks, 
the design, operation, and performance requirements of these high-speed hop and span- 
constrained networks may be different. However, it will be understood that the teaching and 
principles disclosed herein may be applicable to other types of optical switched networks as well. 

[0031] Figure 1 illustrates an exemplary photonic burst-switched (PBS) network 10 in which 
embodiments of the invention described herein may be implemented. This embodiment of PBS 
network 10 includes local area networks (LANs) 13i-13 N and a backbone optical WAN (not 
shown). In addition, this embodiment of PBS network 10 includes ingress nodes 15i-15m, 
switching nodes 17i-17 L , and egress nodes 18i-18 K . PBS network 10 can include other ingress, 
egress and switching nodes (not shown) that are interconnected with the switching nodes shown 
in Figure 1. The ingress and egress nodes are also referred to herein as edge nodes in that they 
logically reside at the edge of the PBS network. The edge nodes, in effect, provide an interface 
between the aforementioned "external" networks (i.e., external to the PBS network) and the 
switching nodes of the PBS network. In this embodiment, the ingress, egress and switching 
nodes are implemented with intelligent modules. This embodiment can be used, for example, as 
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a metropolitan area network connecting a large number of LANs within the metropolitan area to 
a large optical backbone network. 

[0032] In some embodiments, the ingress nodes perform optical-electrical (O-E) conversion 
of received optical signals, and include electronic memory to buffer the received signals until 
they are sent to the appropriate LAN. In addition, in some embodiments, the ingress nodes also 
perform electrical-optical (E-O) conversion of the received electrical signals before they are 
transmitted to switching nodes 17i-17 M of PBS network 10. 

[0033] Egress nodes are implemented with optical switching units or modules that are 
configured to receive optical signals from other nodes of PBS network 10 and route them to the 
optical WAN or other external networks. Egress nodes can also receive optical signals from the 
optical WAN or other external network and send them to the appropriate node of PBS 
network 10. In one embodiment, egress node 18i performs O-E-O conversion of received optical 
signals, and includes electronic memory to buffer received signals until they are sent to the 
appropriate node of PBS network 10 (or to the optical WAN). 

[0034] Switching nodes 17i-17l are implemented with optical switching units or modules 
that are each configured to receive optical signals from other switching nodes and appropriately 
route the received optical signals to other switching nodes of PBS network 10. As is described 
below, the switching nodes perform O-E-O conversion of optical control bursts and network 
management control burst signals. In some embodiments, these optical control bursts and 
network management control bursts are propagated only on preselected wavelengths. The 
preselected wavelengths do not propagate optical "data" bursts (as opposed to control bursts and 
network management control bursts) signals in such embodiments, even though the control 
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bursts and network management control bursts may include necessary information for a 
particular group of optical data burst signals. The control and data information is transmitted on 
separate wavelengths in some embodiments (also referred to herein as out-of-band (OOB) 
signaling). In other embodiments, control and data information may be sent on the same 
wavelengths (also referred to herein as in-band (IB) signaling). In another embodiment, optical 
control bursts, network management control bursts, and optical data burst signals may be 
propagated on the same wavelength(s) using different encoding schemes such as different 
modulation formats, etc. In either approach, the optical control bursts and network management 
control bursts are sent asynchronously relative to its corresponding optical data burst signals. In 
still another embodiment, the optical control bursts and other control signals are propagated at 
different transmission rates as the optical data signals. 

[0035] Although switching nodes 17i-17 L may perform O-E-O conversion of the optical 
control signals, in this embodiment, the switching nodes do not perform O-E-O conversion of the 
optical data burst signals. Rather, switching nodes 17i-17 L perform purely optical switching of 
the optical data burst signals. Thus, the switching nodes can include electronic circuitry to store 
and process the incoming optical control bursts and network management control bursts that 
were converted to an electronic form and use this information to configure photonic burst switch 
settings, and to properly route the optical data burst signals corresponding to the optical control 
bursts. The new control bursts, which replace the previous control bursts based on the new 
routing information, are converted to an optical control signal, and it is transmitted to the next 
switching or egress nodes. Embodiments of the switching nodes are described further below. 
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[0036] Elements of exemplary PBS network 10 are interconnected as follows. 
LANs 13i-13n are connected to corresponding ones of ingress nodes 15i-15m. Within PBS 
network 10, ingress nodes 15i-15 M and egress nodes 18i-18 K are connected to some of switching 
nodes 17i-17l via optical fibers. Switching nodes 17i-17l are also interconnected to each other 
via optical fibers in mesh architecture to form a relatively large number of lightpaths or optical 
links between the ingress nodes, and between ingress nodes 15i-15 L and egress nodes 18i-18k. 
Ideally, there are more than one lightpath to connect the switching nodes 17i-17 L to each of the 
endpoints of PBS network 10 (i.e., the ingress nodes and egress nodes are endpoints within PBS 
network 10). Multiple lightpaths between switching nodes, ingress nodes, and egress nodes 
enable protection switching when one or more node fails, or can enable features such as primary 
and secondary route to destination. 

[0037] As described below in conjunction with Figure 2, the ingress, egress and switching 
nodes of PBS network 10 are configured to send and/or receive optical control bursts, optical 
data burst, and other control signals that are wavelength multiplexed so as to propagate the 
optical control bursts and control labels on pre-selected wavelength(s) and optical data burst or 
payloads on different preselected wavelength(s). Still further, the edge nodes of PBS network 10 
can send optical control burst signals while sending data out of PBS network 10 (either optical or 
electrical). 

[0038] Figure 2 illustrates the operational flow of PBS network 10, according to one 
embodiment of the present invention. Referring to Figures 1 and 2, photonic burst switching 
network 10 operates as follows. 



10 



Attorney Docket 42P18108 

[0039] The process begins in a block 20, wherein PBS network 10 receives packets from 
LANs 13i-13n. In one embodiment, PBS network 10 receives IP packets at ingress 
nodes 15i-15m. The received packets can be in electronic form rather than in optical form, or 
received in optical form and then converted to electronic form. In this embodiment, the ingress 
nodes store the received packets electronically. 

[0040] For clarity, the rest of the description of the operational flow of PBS network 10 
focuses on the transport of information from ingress node 15i to egress node 18i. The transport 
of information from ingress nodes 152- 15m to egress node 18i (or other egress nodes) is 
substantially similar. 

[0041] An optical burst label (i.e., an optical control burst) and optical payload (i.e., an 
optical data burst) is formed from the received packets, as depicted by a block 21. In one 
embodiment, ingress node 15 1 uses statistical multiplexing techniques to form the optical data 
burst from the received IP (Internet Protocol) packets stored in ingress node 15 1. For example, 
packets received by ingress node 15i and having to pass through egress node 18i on their paths 
to a destination can be assembled into an optical data burst payload. 

[0042] Next, in a block 22, Bandwidth on a specific optical channel and/or fiber is reserved 
to transport the optical data burst through PBS network 10. In one embodiment, ingress node 15i 
reserves a time slot (i.e., a time slot of a TDM system) in an optical data signal path through PBS 
network 10. This time slot maybe fixed-time duration and/or variable-time duration with either 
uniform or non-uniform timing gaps between adjacent time slots. Further, in one embodiment, 
the bandwidth is reserved for a time period sufficient to transport the optical burst from the 
ingress node to the egress node. For example, in some embodiments, the ingress, egress, and 
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switching nodes maintain an updated list of all used and available time slots. The time slots can 
be allocated and distributed over multiple wavelengths and optical fibers. Thus, a reserved time 
slot (also referred to herein as a TDM channel), which in different embodiments may be of fixed- 
duration or variable-duration, may be in one wavelength of one fiber, and/or can be spread across 
multiple wavelengths and multiple optical fibers. 

[0043] When an ingress and/or egress node reserves bandwidth or when bandwidth is 
released after an optical data burst is transported, a network controller (not shown) updates the 
list. In one embodiment, the network controller and the ingress or egress nodes perform this 
updating process using various burst or packet scheduling algorithms based on the available 
network resources and traffic patterns. The available variable-duration TDM channels, which 
are periodically broadcasted to all the ingress, switching, and egress nodes, are transmitted on the 
same wavelength as the optical control bursts or on a different common preselected wavelength 
throughout the optical network. The network controller function can reside in one of the ingress 
or egress nodes, or can be distributed across two or more ingress and/or egress nodes. 

[0044] The optical control bursts, network management control labels, and optical data 
bursts are then transported through photonic burst switching network 10 in the reserved time slot 
or TDM channel, as depicted by a block 23. In one embodiment, ingress node 15i transmits the 
control burst to the next node along the optical label-switched path (OLSP) determined by the 
network controller. In this embodiment, the network controller uses a constraint-based routing 
protocol [e.g., multi-protocol label switching (MPLS)] over one or more wavelengths to 
determine the best available OLSP to the egress node. 
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[0045] In one embodiment, the control label (also referred to herein as a control burst) is 
transmitted asynchronously ahead of the photonic data burst and on a different wavelength 
and/or different fiber. The time offset between the control burst and the data burst allows each of 
the switching nodes to process the label and configure the photonic burst switches to 
appropriately switch before the arrival of the corresponding data burst. The term photonic burst 
switch is used herein to refer to fast optical switches that do not use O-E-0 conversion. 

[0046] In one embodiment, ingress node 15i then asynchronously transmits the optical data 
bursts to the switching nodes where the optical data bursts experience little or no time delay and 
no O-E-0 conversion within each of the switching nodes. The optical control burst is always 
sent before the corresponding optical data burst is transmitted. 

[0047] In some embodiments, the switching node may perform O-E-O conversion of the 
control bursts so that the node can extract and process the routing information contained in the 
label. Further, in some embodiments, the TDM channel is propagated in the same wavelengths 
that are used for propagating labels. Alternatively, the labels and payloads can be modulated on 
the same wavelength in the same optical fiber using different modulation formats. For example, 
optical labels can be transmitted using non-return-to-zero (NRZ) modulation format, while 
optical payloads are transmitted using return-to-zero (RZ) modulation format. The optical burst 
is transmitted from one switching node to another switching node in a similar manner until the 
optical control and data bursts are terminated at egress node 18i< 

[0048] The remaining set of operations pertains to egress node operations. Upon receiving 
the data burst, the egress node disassembles it to extract the IP packets or Ethernet frames in a 
block 24. In one embodiment, egress node 18i converts the optical data burst to electronic 
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signals that egress node 18j can process to recover the data segment of each of the packets. The 
operational flow at this point depends on whether the target network is an optical WAN or a 
LAN, as depicted by a decision block 25. 

[0049] If the target network is an optical WAN, new optical label and payload signals are 
formed in a block 26. In this embodiment, egress node 18i prepares the new optical label and 
payload signals. The new optical label and payload are then transmitted to the target network 
(i.e., WAN in this case) in a block 27. In this embodiment, egress node 18i includes an optical 
interface to transmit the optical label and payload to the optical WAN. 

[0050] However, if in block 25 the target network is determined to be a LAN, the logic 
proceeds to a block 28. Accordingly, the extracted IP data packets or Ethernet frames are 
processed, combined with the corresponding IP labels, and then routed to the target network (i.e., 
LAN in this case). In this embodiment, egress node 18i forms these new IP packets. The new IP 
packets are then transmitted to the target network (i.e., LAN) as shown in block 29. 

[0051] PBS network 10 can achieve increased bandwidth efficiency through the additional 
flexibility afforded by the TDM channels. Although this exemplary embodiment described 
above includes an optical MAN having ingress, switching and egress nodes to couple multiple 
LANs to an optical WAN backbone, in other embodiments the networks do not have to be 
LANs, optical MANs or WAN backbones. That is, PBS network 10 may include a number of 
relatively small networks that are coupled to a relatively larger network that in turn is coupled to 
a backbone network, 

[0052] Figure 3 illustrates a module 17 for use as a switching node in photonic burst 
switching network 10 (Figure 1), according to one embodiment of the present invention. In this 
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embodiment, module 17 includes a set of optical wavelength division demultiplexers 30i-30 A , 
where A represents the number of input optical fibers used for propagating payloads, labels, and 
other network resources to the module. For example, in this embodiment, each input fiber could 
carry a set of C wavelengths (i.e., WDM wavelengths), although in other embodiments the input 
optical fibers may carry differing numbers of wavelengths. Module 17 would also include a set 
of N x N photonic burst switches 32i-32b, where TV is the number of input/output ports of each 
photonic burst switch. Thus, in this embodiment, the maximum number of wavelengths at each 
photonic burst switch is AC, where 7V>AC+1. For embodiments in which N is greater than 
AC, the extra input/output ports can be used to loop back an optical signal for buffering. 

[0053] Further, although photonic burst switches 32i-32 B are shown as separate units, they 
can be implemented asiViiV photonic burst switches using any suitable switch architecture. 
Module 17 also includes a set of optical wavelength division multiplexers 34i-34 A , a set of 
optical-to-electrical signal converters 36 (e.g., photo-detectors), a control unit 37, and a set of 
electrical-to-optical signal converters 38 (e.g., lasers). Control unit 37 may have one or more 
processors to execute software or firmware programs. Further details of control unit 37 are 
described below. 

[0054] The elements of this embodiment of module 17 are interconnected as follows. 
Optical demultiplexers 30i-30 A are connected to a set of A input optical fibers that propagate 
input optical signals from other switching nodes of photonic burst switching network 10 
(Figure 10). The output leads of the optical demultiplexers are connected to the set of B core 
optical switches 32i-32b and to optical signal converter 36. For example, optical 
demultiplexer 30i has B output leads connected to input leads of the photonic burst 
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switches 32 r 32 B (i.e., one output lead of optical demultiplexer 30 1 to one input lead of each 
photonic burst switch) and at least one output lead connected to optical signal converter 36. 

[0055] The output leads of photonic burst switches 32i-32 B are connected to optical 
multiplexers 34i-34 A . For example, photonic burst switch 32i has A output leads connected to 
input leads of optical multiplexers 34i-34 A {i.e., one output lead of photonic burst switch 32i to 
one input lead of each optical multiplexer). Each optical multiplexer also an input lead 
connected to an output lead of electrical-to-optical signal converter 38. Control unit 37 has an 
input lead or port connected to the output lead or port of optical-to-electrical signal converter 36. 
The output leads of control unit 37 are connected to the control leads of photonic burst 
switches 32j-32b and electrical-to-optical signal converter 38. 

[0056] In accordance with further aspects of the invention, a dynamically-routed OLSP 
scheduling mechanism employing signaling extensions to a GMPLS-based framework for a PBS 
network is provided. An overview of a GMPLS-based control scheme for a PBS network in 
which the signaling extensions may be implemented in accordance with one embodiment is 
illustrated in Figure 4. Starting with the GMPLS suite of protocols, each of the GMPLS 
protocols can be modified or extended to support PBS operations and optical interfaces while 
still incorporating the GMPLS protocols' various traffic-engineering tasks. The integrated PBS 
layer architecture include PBS data services layer 400 on top of a PBS MAC layer 401, which is 
on top of a PBS photonics layer 402. It is well known that the GMPLS-based protocols suite 
(indicated by a block 403 in Figure 4) includes a provisioning component 404, a signaling 
component 405, a routing component 406, a label management component 407, a link 
management component 408, and a protection and restoration component 409. In some 
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embodiments, these components are modified or have added extensions that support the PBS 
layers 400-402. Further, in this embodiment, GMPLS-based suite 403 is also extended to 
include an operation, administration, management and provisioning (OAM&P) component 410. 
Further information on GMPLS architecture can be found at http://www.ietf.org/internet- 
drafts/draft-ietf-ccamp-gmpls-architecture-07.txt . In addition, a functional description of basic 
GMPLS signaling can be found at http://www.ietf.org/rfc/rfc347 1 .txt . 

[0057] In accordance with one aspect of the invention, signaling component 405 can include 
extensions specific to PBS networks such as, for example, burst start time, burst type, burst 
length, and burst priority, etc. As described in further detail below, GMPLS signaling extensions 
are disclosed for enabling reservation scheduling using the RSVP-TE (ReSerVation Protocol - 
Traffic Engineering) protocol. Link management component 408 can be implemented based on 
the well-known link management protocol (LMP) (that currently supports only SONET/SDH 
networks), with extensions added to support PBS networks. Protection and restoration 
component 409 can, for example, be modified to cover PBS networks. Further information on 
LMP can be found at http://www.ietf.org/internet-drafts/draft-ietf-ccamp-lmp-09.txt . 

[0058] Label management component 407 can be modified to support a PBS control channel 
label space as well. In one embodiment, the label operations are performed after control channel 
signals are O-E converted. The ingress nodes of the PBS network act as label edge routers 
(LERs) while the switching nodes act as label switch routers (LSRs). An egress node acts 
similarly to an egress LER, continuously providing labels for the PBS network. An ingress node 
can propose a label to be used on the lightpath segment it is connected to, but the downstream 
switching node will be the node that ultimately selects a label value, potentially rejecting the 
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proposed label and selecting its own label. In general, a label list can also be proposed by an 
edge or switching node to its downstream switching node. This component can advantageously 
increase the speed of control channel context retrieval (by performing a pre-established label 
look-up instead of having to recover a full context). Further details of label configuration and 
usage are discussed in co-pending U.S. Patent Application No. (Attorney Docket No. 42P16847). 

[0059] To enable PBS networking within hop and span-constrained networks, such as 
enterprise networks and the like, it is advantageous to extend the GMPLS-based protocols suite 
to recognize the PBS optical interfaces at both ingress/egress nodes and switching nodes. Under 
the GMPLS-based framework, the PBS MAC layer is tailored to perform the different PBS 
operations while still incorporating the MPLS-based traffic engineering features and functions 
for control burst switching of coarse-grain (from seconds to days or longer) optical flows 
established using a reservation protocol and represented by a PBS label. 

[0060] In important aspect of the present invention pertains to label signaling, whereby 
coarse-grain lightpaths are signaled end-to-end and assigned a unique PBS label. The PBS label 
has only lightpath segment significance and not end-to-end significance. In exemplary PBS label 
format 500 is shown in Figure 5 with its corresponding fields, further details of which are 
discussed below. The signaling of PBS labels for lightpath set-up, tear down, and maintenance is 
done through an extension of IETF (Internet Engineering Task Force) Resource Reservation 
Protocol-Traffic Engineering (RSVP-TE). More information on GMPLS signaling with RSVP- 
TE extensions can be found at http://www.ietf.org/rf/rfc3473.txt . 

[0061] The PBS label, which identifies the data burst input fiber, wavelength, and lightpath 
segment, optical channel spacing, is used on the control path to enable one to make soft 
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reservation request of the network resources (through corresponding Resv messages). If the 
request is fulfilled (through the Path message), each switching node along the selected lightpath 
commits the requested resources, and the lightpath is established with the appropriate segment- 
to-segment labels. Each switching node is responsible for updating the initial PBS label through 
the signaling mechanism, indicating to the previous switching node the label for its lightpath 
segment. If the request cannot be fulfilled or an error occurred, a message describing the 
condition is sent back to the originator to take the appropriate action {i.e., select another lightpath 
characteristics). Thus, the implementation of the PBS label through signaling enables an 
efficient MPLS type lookup for the control burst processing. This processing improvement of 
the control burst at each switching node reduces the required offset time between the control and 
data bursts, resulting in an improved PBS network throughput and reduced end-to-end latency. 

[0062] In addition to the software blocks executed by the PBS control processor, there are 
several other key components that support PBS networking operations described herein. Link 
Management component 408 is responsible for providing PBS network transport link status 
information such as link up/down, loss of light, etc. The component runs its own link 
management protocol on the control channel. In one embodiment, the IETF link management 
protocol (LMP) protocol is extended to support PBS interfaces. Link protection and restoration 
component 409 is responsible for computing alternate optical paths among the various switching 
nodes based on various user-defined criteria when a link failure is reported by the link 
management component. OAM&P component 410 is responsible for performing various 
administrative tasks such as device provisioning. 
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[0063] Additionally, routing component 406 provides routing information to establish the 
route for control and data burst paths to their final destination. For PBS networks with bufferless 
switch fabrics, this component also plays an important role in making PBS a more reliable 
transport network by providing backup route information that is used to reduce contention. 

[0064] The label signaling scheme of the present invention reduces the PBS offset time by 
reducing the amount of time it takes to process a signaled lightpath. This is achieved by 
extending the GMPLS-based framework to identify each lightpath segment within the PBS 
network using a unique label defined in a PBS label space. The use of a PBS label speeds up the 
PBS control burst processing by allowing the control interface unit within the PBS switching 
node, which processes the control burst, to lookup relevant physical routing information and 
other relevant processing state based on the label information used to perform a fast and efficient 
lookup. Thus, each PBS switching node has access in one lookup operation to the following 
relevant information, among others: 1) the address of the next hop to send the control burst to; 2) 
information about the outgoing fiber and wavelength; 3) label to use on the next segment if 
working in a label-based mode; and 4) data needed to update the scheduling requirement for the 
specific input port and wavelength. 

[0065] Returning to Figure 5, in one embodiment PBS label 500 comprises five fields, 
including an input fiber port field 502, an input wavelength field 504, a lightpath segment ID 
field 506, an optical channel spacing (A) field 508, and a reserved field 510. The input fiber port 
field 502 comprises an 8-bit field that specifies the input fiber port of the data channel identified 
by the label (which itself is carried on the control wavelength. The input wavelength field 504 
comprises a 32-bit field that describes the input data wavelength used on the input fiber port 
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specified by input fiber port field 502, and is described in further detail below. The lightpath 
segment ID field 506 comprises a 16-bit field that describes the lightpath segment ID on a 
specific wavelength and a fiber cable. Lightpath segment ID's are predefined values that are 
determined based on the PBS network topology, The channel spacing field 508 comprises a 4- 
bit field used for identifying the channel spacing (i.e., separation between adjacent optical 
channels) in connection with the A variable defined below. The reserved field 510 is reserved 
for implementation-specific purposes and future expansion. 

[0066] In one embodiment, the input wavelength is represented using IEEE (Institute of 
Electrical and Electronic Engineers) standard 754 for single precision floating-point format. The 
32-bit word is divided into a 1-bit sign indicator 5, an 8-bit biased exponent e, and a 23-bit 
fraction. The relationship between this format and the representation of real numbers is given 
by: 

(-if \2 e ~ ul ) ■ (1 + /) normalized ,0 < e < 255 

Value = i (-if -(2 e ~ 126 )■(<) + /) denormalized,e = 0,/ > 0 
exceptional value otherwise 

Eq.(1) 

[0067] One of the optical channels in the C band has a frequency of 197.200 THz, 
corresponding to a wavelength of 1520.25 nm. This channel is represented by setting s = 0, 
e = 134, and/= 0.540625. The adjacent channel separation can be 50 GHz, 100 GHz, 200 GHz, 
or other spacing. For 50 GHz channel separation, it can be written as: A = 0.05 = 1.6-2~ 5 (s = 0, e 
= 122,/= 0.6). Thus, the frequency of the nth channel is given by: 
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/(") = /(!) -(»-!)• A Eq.(2) 

[0068] Thus, according to equation (2), the optical channel frequency is given by n and the 
specific value of A, which can be provided as part of the initial network set-up. For example, 
using the standard ITU-T (International Telecommunications Union) grid C and L bands, n is 
limited to 249, corresponding to an optical frequency of 184.800 THz. However, other optical 
channel frequencies outside the above-mentioned range or other wavelength ranges such as 
wavelength band around 1310 nm can be also defined using equation (2). 

[0069] Operation of how PBS label 500 is implemented in a GMPLS-based PBS 
network 6500 is illustrated in Figure 6. Network 600, which may comprise one of various types 
of networks, such as an enterprise network, contains four PBS switching nodes, labeled B, C, E, 
and F, and two edge nodes labeled A and D. Network 600 is coupled at one end to a LAN or 
WAN network 602 and a LAN or WAN network 604 at another end, wherein nodes A and D 
operate as edge nodes. For the following example, it is desired to route traffic from network 602 
to network 604. Accordingly, edge node A (i.e., the source node) operates as an ingress node, 
while edge node D (i.e., the destination node) operates as an egress node. 

[0070] The various switching nodes B, C, E, and F are coupled by lightpath segments LP1- 
LP13, as shown in Figure 6. A lightpath segment comprises an optical connection via optical 
fibers between any adjacent nodes. A lightpath comprises the optical path traveled between 
source and destination nodes, and typically will comprises a concatenation of a plurality of 
lightpath segments. In the illustrated example discussed below, one of the lightpaths between the 
source node (ingress node A) and the destination node (egress node D) comprises lightpath 
segments LP1, LP4, and LP6. 
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[0071] As further shown in Figure 6, exemplary PBS labels A-B-0 and A-B-l are assigned to 
the path between nodes A and B at times t 0 and t\, respectively; labels B-C-0 and B-C-l are 
assigned to the path between nodes B and C nodes at times t 0 and t\\ and labels C-D-0 and C-D-l 
are assigned to the path between nodes C and D nodes at times t 0 and t\. For the purpose of 
simplicity, the lightpath segment ID's for lightpath segments LP1, LP2, LP3, LP4, LP5 and LP6 
are respectively defined as 0x0001, 0x0002, 0x0003, 0x0004, 0x0005, and 0x0006. In 
accordance with foregoing aspects of PBS networks, a particular LSP may comprise lightpath 
segments employing a single wavelength, or different wavelengths. For example, in the 
illustrated example, label A-B-0 defines the use of an optical frequency of 197.2 THz 
(0x08683FDl), label B-C-0 defines the use of a frequency of 196.4 THz (0x08682767), and 
label C-D-0 defines the use of a frequency of 195.6 THz (0x08680EFD). On the way from A to 
D the signaling packet requests resource reservation on a lightpath segment-by-segment basis 
{i.e. LP1, LP4, and LP6). For example, edge node A requests resources to create a coarse-grain 
reservation of a selected lightpath. On the first lightpath segment, switching node B checks if it 
has sufficient resources to satisfy the request. If it doesn't have the resources, it sends an error 
message back to the originator of the request to take the appropriate action such as send another 
request or select another lightpath. If it has enough resources, it makes a soft reservation of these 
resources, and forwards it to the next switching node, wherein the operations are repeated until 
the destination node D is reached. When node D receives the soft reservation request, it checks 
if it can be fulfilled. 

[0072] To support reservation signaling, a signaling mechanism is implemented that employs 
extensions to the Resource reSerVation Protocol - Traffic Engineering (RSVP-TE). In general, 
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the RSVP-TE protocol is itself an extension of the RSVP protocol, as specified in IETF RFC 
2205. RSVP was designed to enable the senders, receivers, and routers of communication 
sessions (either multicast or unicast) to communicate with each other in order to set up the 
necessary router state to support various IP-based communication services. RSVP identifies a 
communication session by the combination of destination address, transport-layer protocol type, 
and destination port number. RSVP is not a routing protocol, but rather is merely used to reserve 
resources along an underlying route, which under conventional practices is selected by a routing 
protocol. 

[0073] Figure 7 shows an example of RSVP for a multicast session involving one traffic 
sender SI, and three traffic receivers, RCV1, RCV2, and RCV3. The diagram in Figure 7 is 
illustrative of the general RSVP operations, which may apply to unicast sessions as well. 
Upstream messages 700 and downstream messages 702 sent between sender SI and receivers 
RCV1, RCV2, and RCV3 are routed via routing components (e.g., switching nodes) Rl, R2, R3, 
and R4. The primary messages used by RSVP are the Path message, which originates from the 
traffic sender, and the Resv message, which originates from the traffic receivers. The primary 
roles of the Path message are first to install reverse routing state in each router along the path, 
and second to provided receivers with information about the characteristics of the sender traffic 
and end-to-end path so that they can make appropriate reservation requests. The primary role of 
the Resv message is to carry reservation requests to the routers along the distribution tree 
between receivers and senders. 

[0074] Connection creations requests are issued via a Path message. Details of a Path 
message 800 with signaling extensions in accordance with an embodiment of the invention is 
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shown in Figures 8a-c. For clarity, Path message 800 only shows fields that are pertinent to 
reservation signaling mechanism described herein; it will be understood that the Path message 
may further include additional fields specified by the RSVP-TE protocol. Also for clarity, fields 
that are augmented or added to the standard RSVP-TE data structures are shown in bold. 
Finally, objects contained in square brackets ([...]) are optional. 

[0075] The illustrated objects of Path message 800 include a Common Header 802, an 
optional Integrity object 804, a Session object 806, an RSVPJiop object 808, a Time_Values 
object 810, an optional Explicit_Route object 811, a generalized PBS_Label_Request object 812, 
an optional Label_Set object 814, an optional Admin_Status object 816, a 
Destination_PBS_address object 818, a Source_PBS_ Address object 820, an optional 
Policy_Data object 822, and a sender descriptor object 824. 

[0076] The optional Integrity object 804 carries cryptographic data to authenticate the 
originating node and to verify the contents of the RSVP message. The Session object 806 
contains the IP destination address (Dest Address), the IP protocol ID, and some form of 
generalized destination port, to define a specific session for the other objects to follow. In one 
embodiment, the information identifying an Internet Protocol version 4 (IPv4) session, is stored 
in Session object 806. Optionally, Internet Protocol version 6 (IPv6) may be employed. 

[0077] RSVP_Hop object 808 carries the IP address of the RSVP-capable node that sent the 
message (the most recent in the chain of nodes) and a logical outgoing interface handle LIH. 
RSVPJHop objects for downstream messages are known as PHOP ("previous hop") objects, 
while upstream RSVP_Hop objects are known as NHOP ("next hop") objects. Thus PHOP 
RSVP_Hop objects are labeled 808P, while NHOP RSVP_Hop objects are labeled 808N herein. 
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[0078] Under conventional practice, TimeJValues object 810 would contain the value for the 
refresh period used by the creator of the message. However, in accordance with principles of the 
invention, the object is used to store time values specifying the start and end of an OLSP 
reservation. 

[0079] The signaling protocol also supports explicit routing. This is accomplished via the 
explicit route object 811. This object encapsulates a concatenation of hops that constitute the 
explicitly routed path. Using the object, the paths taken by label-switched RSVP-MPLS flows 
can be pre-determined, independent of conventional IP routing. The explicitly routed path can 
be administratively specified, or automatically compute by a suitable entity based on QoS 
(Quality of Service) and policy requirements, taking into consideration the prevailing network 
state. In general, path computation can be control-driven or data-driven. 

[0080] Details of a generalized PBS_Label_Request object 812 format in accordance with 
one embodiment are shown in Figure 9. The object's format includes a length field 900, a Class- 
Num field 902, a C-Type field 904, and object contents 906. The values in both Class-Num 
field 902 and C-Type field 904 are constants that are standardized once a protocol goes through 
the standard track. In one embodiment, object contents 906 include a PBS label having a format 
shown in Figure 5 and described above. 

[0081] The LabeLSet object 814 is used to limit the label choices of a downstream node to a 
set of acceptable labels. This limitation applies on a per hop basis. RFC 3271 discusses four 
cases where a label set is useful in the optical domain. The first case is where the end equipment 
is only capable of transmitting on a small specific set of wavelengths/bands. The second case is 
where there is a sequence of interfaces that cannot support wavelength conversion (Cl-incapable) 
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and require the same wavelength be used end-to-end over a sequence of hops, or even an entire 
path. The third case is where it is desirable to limit the amount of wavelength conversion being 
performed to reduce the distortion on the optical signals. The last case is where two ends of a 
link support different sets of wavelengths. 

[0082] The LabeLSet object 814 is used to restrict label ranges that may be used for a 
particular LSP between two peers. The receiver of a LabeLSet must restrict its choice of labels 
to one which are specified in the Label_Set 814. Much like a label, a Label_Set 814 may be 
present across multiple hops. In this case each node generates its own outgoing Label_Set, 
possibly based on the incoming Label_Set and the node's hardware capabilities. This case is 
expected to be the norm for nodes with conversion-incapable (Cl-incapable) interfaces. The use 
of the LabeLSet 814 is optional; if not present, all labels from the valid label range may be used. 
Conceptually the absence of a specific Label_Set object implies a LabeLSet object whose value 
is {U}, the set of all valid labels. 

[0083] The Admin_Status object 816 is used to notify each node along the path of the status 
of an LSP. Status information is processed by each node based on local policy and the 
propagated in the corresponding outgoing messages. The object may be inserted in either Path 
or Resv messages at the discretion of the ingress (for Path messages) or egress (for Resv 
messages) nodes. 

[0084] The Destination_PBS_Address object 818 contains the IP address of the destination 
node (i.e., the egress node). As discussed above, this information may be provided in the session 
object; for clarity it is shown as separate data in Figure 8a. Similarly, the Source_PBS_Address 
object 820 contains the IP address of the source node (i.e., the ingress node). 
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[0085] Further details of sender descriptor 824 for unidirectional and bi-directional PBS light 
paths are respectively shown in Figures 8a and 8b. Figure 8a shows a unidirectional sender 
descriptor 824A that includes a sender template object 826. and a PBS_Sender_TSpec object 828. 
The bi-directional sender descriptor 824B further includes an upstream label 830 in addition to a 
sender template object 826 and a PBS_Sender_TSpec object 828. 

[0086] Figures 10a and 10b illustrate the various objects of a Resv message 1000 in 
accordance with one embodiment. As with conventional RSVP practice, a Resv message is 
issued by a receiving node in response to a Path message. Accordingly, Resv message 1000 
shares many object with Path message 800, including a common header 802, Integrity 
object 804, Session object 806, RSVP_Hop object 808, Time_Values object 810, Admin_Status 
object 816, and Policy_Data object 822. In addition, Resv message 1000 a reservation 
configuration object 1004, a Style object 1006, and a flow descriptor object 1008. 

[0087] Reservation confirmation object (Resv_Confirm) 1004 holds data that is used to 
confirm a reservation for a corresponding PBS resource. Further details of resource reservations 
are described below. Style object 1006 contains data identifying the reservation style, i.e., FF 
(Fixed Filter - distinct reservation and explicit sender selection), SE (Shared Explicit - shared 
reservation and explicit sender selection), and WF (Wildcard Filter - shared reservation and 
wildcard sender selection). 

[0088] Flow descriptor 1008 contains objects for describing data flows. These objects 
include a PBSJFlowspec 1010, a Filter_Spec 1012, and a Generalized_PBS_Label 1014. 

[0089] A PathTear message 1 100 employed to request the deletion of a connection is shown 
in Figure 11. The PathTear message 1100 includes objects that are corollary with Path 
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message 800. These objects include a Common Header 802, an optional Integrity object 804, a 
Session object 806, an RSVP_Hop object 808, and optional Admin_Status 816, and a sender 
descriptor 824. 

[0090] A ResvTear message 1200 issued in response to a PathTear message 1100 is shown 
in Figure 12. The ResvTear message 1200 includes a Common Header 802, an optional Integrity 
object 804, a Session object 806, an RSVPJlop object 808, and optional Admin_Status 816, a 
Style object 1006, and a flow descriptor 1200. 

[0091] A common format is employed for PBS JSender_TSpec object 828 and 
PBS_Flowspec object 1010. Each object includes a length field 1300, a Class-Num field 1302, a 
C-Type field 1304, object contents 1306, a reserved field 1308, and a bandwidth % field 1310. 
PBS_Send_TSpec objects 828 and PBS_Flowspec objects 1010 can be identified by their 
respective Class-num/C-Type values. The value in bandwidth % field 1310 represents the 
amount of bandwidth expressed by the intermediate node as a percent of the available bandwidth 
on a given lightpath segment. An intermediate node (i.e., a switching node) normalizes this 
percentage to the available bandwidth of its outgoing link. This enables each of the switching 
nodes to build-up its bandwidth allocation table for all the incoming label requests and determine 
if it can satisfy each bandwidth request. 

Peer Routing 

[0092] In a dynamic routing embodiment known as "peer-to-peer" (or simply "peer") 
routing, resource availability information is dynamically updated in response to resource 
reservations, enabling routing for an entire lightpath to be dynamically determined. During 
continuing operations, new reservations are made, consuming node and lightpath segment 
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resources. After a reservation for a lightpath has expired, the reserved resources for the lightpath 
are "released" and made available for new reservations. In one embodiment, a best lightpath 
route is selected at an ingress (i.e., source) node based on link state information in combination 
with performance metrics. A corresponding resource reservation message is generated 
containing information that explicitly defines the route. The resource reservation message is 
then routed to the various nodes along the route, and corresponding resources are reserved. 
Under another embodiment, the route may be dynamically determined using hop-by-hop route 
determination in a manner similar to IP routing. 

[0093] The network topology or link state information essentially consists of all the lightpath 
segments and nodes along with all of the associated parameters for these resources (i.e., link 
costs, resources information, etc. Under a modified Open Shortest-Path First (OSPF) protocol, 
each edge and switching node periodically broadcasts its link state information to all other edge 
and switching nodes, using the OOB network management channel. Information concerning 
changes in the network topology (e.g., new links added or existing links removed, node or link 
failure, etc.) is likewise broadcast to all edge and switching nodes. More specifically, in one 
embodiment link state information is interchanged using a modified control burst format that 
includes an Extended Header field in which the link state information is stored. 

[0094] Figure 14a shows a flow diagram illustrating an overview of the operations performed 
at the edge and switching nodes in support of peer routing, according to one embodiment of the 
invention. As depicted by the flowchart, the operations are performed in parallel. The top 
portion of the diagram concerns activities that are performed during network initialization and 
set up. In one embodiment, each edge and switching node builds a lightpath database 



30 



Attorney Docket 42P18108 

representing all the possible PBS lightpaths and node resources. The lightpath information is 
stored in a table containing routing data similar to that contained in IP network routers. In one 
embodiment, the lightpath database is built via message exchanges between the various nodes 
using an OOB network management channel. In one embodiment, the PBS network topology 
discovery is accomplished by running a modified OSPF IP routing protocol. The OSPF protocol 
is well-known in the art, and is commonly used for IP routing within an autonomous system 
(e.g., LAN or enterprise network). From the information contained in the messages, routing 
trees can be built and forwarded to other nodes, enabling each switching node to build its own 
routing tree table. These operations are depicted in blocks 1400 and 1402. In another 
embodiment, the routing tree tables may be manually generated. This scheme is generally more 
applicable to smaller networks. 

[0095] In one embodiment, each switching node builds a "fixed" routing tree (embodied as a 
table) defining the available routes to other nodes based on the network topology at the time the 
routing trees are built. For example, a routing tree 1500B illustrative of an exemplary set of 
routing data contained in a fixed topology routing tree table is shown in Figure 15. In this 
embodiment, each of switching nodes B, C, E, and F are modeled as respective routers R B , Rc, 
R E , and R F , while the edge nodes A and D are respectively labeled E A and E D . The routing 
tree 1500B graphically depicts the necessary hops from router R B (i.e., switching node B) to 
reach edge nodes E A and E D . The bottom node along each "branch", also known as a "leaf 
node, represents the destination, while the topmost node (R B in this instance) represents the 
"root" of the routing tree. In general, circular or circuitous routes are not to be included in the 
routing tree table, as such routing is very inefficient. For example, a route between switching 
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node C and edge node D spanning Rc-Re-Rb-Rf-Ed should not be included, even though it is a 
possible route between nodes C and D. Similar routing tables corresponding to routing 
trees 1500C, 1500E and 1500F are built at routers R c , Re, Rf, respectively. Techniques for 
building routing trees of this type are well-known in the networking art. 

[0096] As depicted by a block 1406, a primary initialization task performed at each edge 
node involved building its own routing tree table. In one embodiment, the switching node 
routing tree information is propagated via message exchanges to the edge nodes during the 
network topology discovery operations, as depicted in a block 1404. In general, the edge nodes 
may receive routing tree information from their adjacent switching nodes and advertise the 
available routes. In response to receiving the routing tree tables from its adjacent switching 
nodes, each edge node will build its own routing tree table in block 1406. 

[0097] In another embodiment, routing tree tables for the edge nodes are generated manually. 
This is performed in a manner similar to generating routing tree tables for the switching nodes. 
In one embodiment, routing tree tables are built by both the edge and switching nodes. In 
another embodiment, routing tree tables are manually generated for the edge nodes, while the 
switching nodes do not maintain any routing tree tables. 

[0098] In one embodiment, the routing tree tables for edge nodes A and D will contain data 
corresponding to routing trees 1600A and 1600D, as shown in Figure 16. In many respects, 
routing trees 1600A and 1600D are similar to the routing trees 1500B, 1500C, 1500E, and 1500F 
discussed above. The routing tree for a given edge node contains all routes from itself (as the 
source node) to the other edge nodes (the possible destination nodes). More specifically, the 
routing tree for each edge node will contain an aggregation of routing data comprising the routes 
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generated by its adjacent switching nodes with the addition of respective lightpath segments 
between the edge and each adjacent switching node. Thus, in the example of Figure 16, routing 
tree 1600A contains all routes from source node A to destination node D. Similarly, all routes 
from node D (as a source node) to node A (as a destination node) are stored in a routing 
tree 1600D. 

[0099] In addition to this basic routing information, routing tree 1600A further includes 
details of lightpath segments and, in one embodiment, wavelength information for those 
segments. For example, as shown in detail 1602 of the lightpath between routers Rb and R E , 
information is stored corresponding to lightpaths LP1 and LP2, as well as wavelengths X\. n for 
each lightpath. 

[00100] In general, the best route from each ingress node to a corresponding destination node 
will depend on the available network resources in consideration of performance criteria such as 
class-of-service (CoS), traffic loadings, etc. In one embodiment, route availability and 
performance criteria data are stored in the routing database. In one embodiment, an instance of a 
routing database is stored at each edge node. In one embodiment, respective database instances 
are maintained at the edge nodes, and a replication mechanism is employed such that the data in 
the database instances is replicated on a periodic basis. 

[00101] Exemplary routing tree data 1700A and 1700B illustrative of data that are stored in a 
routing tree table maintained at an edge node are shown in Figure 17. Typically, each row in a 
routing tree table will comprise a list of lightpath segments that, when concatenated, form a 
lightpath between the edge node and a destination node. Thus, the routing tree table will store a 
list of potential lightpaths that may be considered for transmitting data from a source node to a 
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destination node. In the embodiment shown in routing tree table 1700 A, lightpath routing 
information is defined at the lightpath segment level. Accordingly, only data signifying lightpath 
segments, such as a lightpath segment ID, needs to be stored in the table. 

[00102] As discussed above, WDM implementations support transmissions using multiple 
concurrent wavelengths for each fiber segment. In support of this case, entries in routing 
table 1700A may be expanded to include routing details at the lightpath segment/wavelength 
level. This is exemplified by entries shown in a routing table 1700B corresponding to lightpath 1 
of routing table data 1700A, wherein each of lightpath segments LP1, LP3, and LP5 support 
wavelengths X\, X 2 and A3. 

[00103] In one embodiment, the routes stored in a routing tree table are prioritized such that 
the lightpaths with the highest priority ratings are selected prior to lower priority lightpaths. For 
example, a prioritization algorithm may be used to prioritize lightpaths in the list as a function of 
one or more specific transmission-related criteria, such as single wavelengths first (i.e. 9 
lightpaths in which a single wavelength is used throughout the route) or as a function of the 
routing availability or traffic rules enforced at the time. The prioritization algorithm may also be 
used to reduce the size of the routing tree table, wherein lower priority routes are discarded or 
marked as unavailable. Furthermore, the prioritization of the potential lightpaths can be 
dynamically updated (Le. 9 reprioritized) if a change in network transmission conditions is 
detected, such as a change in network topology of if there is a need to balance the traffic loads 
across the network to achieve a desired performance. In other instances, the prioritization of the 
lightpaths may be determined based on observation of network behavior, e.g., through use 
statistical traffic data or employing a heuristic traffic prediction algorithm. Lightpath selection 
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techniques of this sort are well-known in the art, so no further explanation of how this operation 
is performed is included herein. 

[00104] After the edge nodes build their routing trees, a LINK_AVAILABILITY table is 
initialized in a block 1408. In one embodiment, routing tree data is stored in one table, while 
information pertaining to the availability of individual lightpaths or lightpath/wavelength 
combinations are stored in the LINK_AVAlLABILrrY table. For example, exemplary data 
corresponding to a LINK_ AVAILABILITY table 1800 is shown in Figure 18. Overall, the 
purpose of the routing tree table and LINK_AVAILABIL[TY table is to provide an edge node 
with information that will enable the edge node to select the best available route to schedule for 
subsequent data transmission for a selected time slot. As a result, this information may be 
combined into a single table, or may be stored in two or more linked tables. In one embodiment, 
the LINK_ AVAILABILITY table may be initialized based on data stored in the routing tree 
table. In another embodiment, link configuration data is broadcast to the edge nodes by the 
switching nodes, as depicted in a block 1410. The LINK_AVAILABILITY table may then be 
initialized based on the link configuration data received at each edge node. 

[00105] The operations and logic shown in the middle portion of Figure 14 concern ongoing 
or continuous network operations that may be started after the initialization phase is complete. 
Thus, the operations and logic shown are repeated continually during this phase. 

[00106] A typical transmission cycle begins in a block 1412, wherein a lightpath reservation 
process is initiated. In one embodiment, the reservation process begins by selecting a lightpath 
route. In the illustrated embodiment of Figures 17 and 18, this comprises a two-step process. 
First, a lightpath route is chosen based on prioritization rules. During this first step, the 
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wavelength availability of the selected lightpath route is not considered, so that all "active" 
lightpaths are available for selection, while "inactive" lightpaths are not available. For example, 
an inactive lightpath might correspond to an existing entry in the routing tree table that is not 
currently available due to a node or lightpath segment failure. 

[00107] Next, during the second step, the availability of each lightpath segment of the 
lightpath is considered by looking up resource consumption or availability information in 
LINK_AVAILABILITY table 1800. In one embodiment, this is an iterative process, wherein 
each segment along the lightpath is evaluated, one at a time. If resources for all segments are 
available, the lightpath selection is verified. If not, a new lightpath is selected, and the resource 
availability process is performed again to verify the availability of the selected lightpath. 

[00108] Typically, the LINK_AVAILABILTY table will include data that identifies an 
availability of individual fiber segments or segment/wavelength combination. In one 
embodiment, individual records contain data that is directly derived from resource reservation 
data that is broadcast to the edge nodes on a periodic basis. For example, information 
corresponding to the "incoming" or "outgoing" half of the reservation records stored in resource 
reservation table 2000 described below are stored in LINK_AVAILABILITY table 1800. 

[00109] The LINK_AVAILABILITY table 1800 includes a plurality of columns containing 
information relating to the availability of various lightpath segments during corresponding 
timeslots. In the illustrating embodiment, these include a Lightpath Segment ID column 1802, a 
Wavelength column 1804, a Start Time column 1806, and End Time column 1808, a 
Bandwidth % column 1801, and a Link Status column 1812. As will be seen below, in one 
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embodiment the data in LINK_AVAILABILITY table 1800 mirrors resource reservation data 
maintained at the various switching nodes in the network. 

[00110] The Lightpath Segment ED column 1802 contains the lightpath segment ID for a 
corresponding lightpath segment reservation. The Wavelength column 1804 contains the 
wavelength for the identified lightpath segment reservation. Although shown as a single column 
for illustrative purposes, wavelength information may be contained in two or more columns as 
well, depending on the wavelength translation scheme used. The values in Start Time and End 
Time columns 1806 and 1808 respectively correspond to the start and end times of a variable 
timeslot for which the lightpath segment reservation is made. The Bandwidth % column 1810 is 
used to store the percentage of bandwidth for the corresponding wavelength that is consumed by 
the reservation. In implementation in which fraction bandwidth allocations are not supported, 
their will be no Bandwidth % column. The value in Link Status column 1812 is a Boolean value 
that indicates whether the lightpath segment is available or not. This column facilitates a faster 
link availability lookup under conditions in which links may be down. A "1" indicates the link is 
active, while a "0 M indicates the link is inactive. 

[00111] The link availability data may be stored in a variety of different manners. In the 
embodiment illustrated in Figure 18, individual records contain data that is identical portions of 
corresponding records stored in the resource reservation tables of each of the switching nodes. 
Thus, in order to determine the availability of a particular resource for a particular time slot, all 
of the records containing an overlapping time slot and corresponding to the particular lightpath 
segment or lightpath segment/wavelength combination are retrieved from 
LINK_AVAILABILITY table 1800 and aggregated to determine a total bandwidth consumption 
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percentage. If the remaining (i.e., non-reserved) portion of the bandwidth is sufficient to support 
the to be reserved data transmission, then the process moves on to evaluated the availability of 
the next lightpath segment in the lightpath route. If not, an error message is generated to inform 
the edge node that it needs to select another route, as the selected route includes a resource that is 
not available. In another embodiment, the current bandwidth consumption or availability 
percentage or bandwidth amount is updated in response to receiving the link state data that is 
broadcast in block 1452 below. This process requires more overhead up front, but returns a 
faster result during a real-time query request for resource availability. 

[00112] Once a selected lightpath has been verified for resource availability, a resource 
reservation message (in the form of a Path message) is generated that includes an explicit 
definition of the route. For example, Figure 16 shows details of an exemplary Path 
message 1600 corresponding to a first pass of the resource reservation process in which an 
explicit lightpath route of LP1-LP3-LP5 is defined. The destination PBS address 818 contains 
the IP address of the destination node D, while source PBS address 820 contains the IP address 
of source node A. Since the most recent node to forward the message is the source node A, 
RS VP_Hop object 808P contains the IP address for node A 

[00113] Information specifying the lightpath segment and wavelength for, the each of the 
lightpath segments along the route are stored in label set 814 under generalized PBS label 
request object 812. Reservation information corresponding to the first hop in the route is 
contained in a label A-B-LPlXl. Each label includes information identifying an input fiber port 
for the receiving node (e.g., input fiber port 1 of switching node B), an input wavelength under 
which data signals will be transmitted on the fiber coupled to the input fiber port (195.6 THz) (it 
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is noted that the input wavelength is actually determined as a function of the values in input 
wavelength field 504 and A field 508, as discussed above - a specific value is used here for 
illustrative purposes), and the lightpath segment ID (e.g., LP1) for the lightpath coupled between 
the sending and receiving nodes. 

[00114] As discussed above, the reservations to be made comprise fine to coarse-grain time 
slot reservations corresponding to future scheduled uses of virtual network links comprising 
lightpaths made up of multiple concatenated lightpath segments. Accordingly, time period data 
corresponding to Time_Values object 810 comprising a start and end time for a corresponding 
reservation time slot are respectively stored in a start time object 8 10A and an end time 
object 810B. For illustrative purposes, the start time depicts 12:00:00 (i.e. 12 noon) and 
12:01:00; in an actual implementation, date information may be included as well, either in the 
same fields or additional fields. 

[00115] The explicit route information is contained in Explicit_Route object 811. In this 
instance, the Explicit_Route 811 contains a selected route of LPl-to-LP3-to-LP5. 

[00116] In accordance with another aspect of the invention, reservations for the use of 
lightpath segments used to make up a given lightpath may be defined such that only a partial 
amount of the channel bandwidth is used. As discussed above, information defining a 
bandwidth % for the reservation may be stored in bandwidth % field 1310 of sender descriptor 
object 824. Reservation for a resource request that consumes less than or equal to the total 
available bandwidth for a given resource are accepted, while requests that would consume 
unavailable bandwidth will be denied. 
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[00117] Once generated, the Path message is sent to the first node in the explicit route, which 
will comprise a switching node. The Path message is then propagated through the upstream 
nodes defined by the explicit route, while the operations of blocks 1414, 1416, and 1418 are 
preformed at each switching node. These operations begin in block 1414, wherein the resource 
availability is confirmed. The reason for this operation is that although resource availability was 
previously verified at the source edge node, the availability data stored at that edge node may not 
be completely up to date. For example, a resource may be consumed by a reservation that is 
made after the most recent broadcast of link state information by a given node. In one 
embodiment, resource availability confirmation may be performed by retrieving any resource 
reservation records for the resource (e.g., lightpath segment or lightpath segment/wavelength) 
and time slot and verifying that there is enough remaining bandwidth % to meet the reservation 
request. 

[00118] For example, a determination would be made to whether sufficient bandwidth 
percentage was available for the time slot based on a combination of fiber link (lightpath 
segment) and wavelength. Thus, a determination would be made in decision block 1408 to 
whether either of lightpath segments LP3 or LP4 could support the requested reservation. The 
availability information can be determined based on information contained in the resource 
reservation table for the current node, as described below. 

[00119] An indication of sufficient resources means that the specified resource {i.e., the 
bandwidth request at the wavelength for the lightpath segment received at the next hop node) has 
not been previously scheduled for use over any portion of the specified time slot. In one 
embodiment, this information may be determined based on resource reservation lookup tables 



40 



Attorney Docket 42P18108 

stored at each node, as exemplified by a resource reservation table 2000 shown in Figure 20. 
The resource reservation table contains data pertaining to "soft" (requested, but yet to be 
confirmed) and "hard" (confirmed) reservations for the various transmission resources provided 
by the node. Reservation table 2000 includes a plurality of columns in which data are stored in 
rows, wherein data for a given row is called a "record," and the columns for a given record 
comprise data fields. The columns include an optional Key column 2002, an Input Fiber Port 
column 2004, an Input Wavelength column 2006, an optional Input Lightpath Segment ID 
column 2008, an Output Fiber Port column 2010, and Output Wavelength column 2012, an 
optional Output Lightpath Segment ID column 2014, a Start Time column 2016, and End Time 
column 2018, a Bandwidth % column 2020, and a Status column 2022. 

[00120] In general, Key column 2002 is used to store a unique identifier (key) for each record, 
enabling quick retrieval of records and guaranteeing record uniqueness. In one embodiment, the 
key contains information corresponding to the session object 806 of the Path message. In 
another embodiment, the key is derived from a combination of data in fields corresponding to the 
PBS label (i.e., in input fiber port column 2004, input wavelength column 2006, and lightpath 
segment ID column 2008). This enables quick lookup of reservation entries in response to 
processing control bursts containing specific PBS resource allocation requests. 

[00121] Data relating to incoming link parameters are stored in Input Fiber Port column 2004, 
Input Wavelength column 2006, and, optionally, Input Lightpath Segment ID column 2008, 
while data relating to outgoing link parameters are stored in Output Fiber Port column 2010, 
Output Wavelength column 2012, and, optionally, Output Lightpath Segment ID column 2014. 
Each switching node is coupled to two or more fiber links via respective I/O ports. For example, 
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the exemplary data in reservation table 2000 corresponds to switching node B, which includes 
six network input/output (I/O) ports, depicted as encircled numbers 1-6 in Figure 15. The value 
in Input Fiber Port column 2004 identifies the I/O port at which data is received by the particular 
switching node maintaining the reservation table, while the value in Output Fiber Port 
column 2010 identifies the I/O port via which data is transmitted. As an option, input and output 
fiber routing data may be stored through reference to input and output lightpath segments, in lieu 
of or in addition to specifying I/O ports. Accordingly, in one embodiment, data identifying the 
input and output lightpath segments is stored in Input Lightpath Segment ED column 2008 and 
Output Lightpath Segment ID column 2014, respectively. 

[00122] As discussed above, a given lightpath segment may support concurrent data streams 
that are transmitted using different wavelengths. Accordingly, data in Input Wavelength 
column 2006 is used to identify the wavelength incoming data is to be transmitted at for a given 
reservation record, while data in Output Wavelength column 2012 is used to identify the 
wavelength outgoing data is to be transmitted at. 

[00123] Routing paths through each switching node are reserved for a respective time slot of 
variable duration based on appropriate information contained in the reservation request message. 
Typically, the time slot will be delineated by a start time and an end time, with corresponding 
data stored in Start Time column 2016 and End time column 2018. Typically, a resource 
reservation request will reserve a resource time slot for a variable amount of time from sub- 
seconds to days. For simplicity, the time data shown Start Time column 2016 and End Time 
column 2018 only reflect the second level. 
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[00124] The bandwidth % for the request, as well as previously allocated bandwidth %'s, are 
stored in bandwidth % column 2020. Status bits identifying unconfirmed (0) and confirmed (1) 
reservations are stored in reservation status column 2022. 

[00125] In one aspect, resource availability is determined based on the bandwidth availability 
for the requested lightpath segment, input wavelength, and time period. It is noted that any entry 
with a time period overlapping the requested time period and having similar parameters to the 
requested resource is considered, including reservations for both incoming and outgoing traffic. 
The bandwidth percent of the entries is aggregated, along with the requested bandwidth. If the 
sum of the bandwidth exceeds a selected threshold value (e.g., 100%) within the same start and 
end times, there are inadequate resources to satisfy the request. 

[00126] Returning to Figure 14a, in a decision block 1416, a determination is made to whether 
the resource is available. If it is, the logic moves forward to block 1418, wherein the Path 
message is forwarded to the next node identified by the explicit route. In connection with this 
operation, the IP address information contained in RSVP Hop object 808P is updated to reflect 
the new previous hop node (e.g., node B in this example). If the resource is unavailable, the 
reservation cannot be completed. In response, in one embodiment a PathErr message is 
propagated back or broadcast to the source edge node. Upon receiving the message, the source 
edge node selects a new route, and the previous route reservation process is repeated, beginning 
at block 1412. In another embodiment, each switching node is configured to maintain its own 
routing tree table. In this embodiment, the Path message may be rerouted by the switching node 
by updating the explicit route defined by Label_Set 812. In one embodiment, the entirety of the 
remainder of the lightpath route may be specified by the switching node. In another 
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emobidment, the remainder of the route is determined on a hop-by-hop basis, wherein the 
determination of the next hop is made as the reservation message is passed through various 
switching nodes along a (now) inexplicit route. 

[00127] Once the Path message has been forwarded to the destination, operations are 
performed to confirm the soft reservations at each node by propagating a reservation 
acknowledge (Resv) message downstream along the reserved lightpath route. These operations 
are depicted by a block 1420. 

[00128] Further details of the operations of block 1420 are shown in Figure 14b, which 
represents the upstream portion of the reservation request. At this point the current node is the 
destination node D, as depicted by a start block 1421. As before, operations are repeated for 
each of the nodes along the selected lightpath, akin to a back-propagation technique; these 
operations are delineated by start and end loop blocks 1423 and 1424. The operations are 
performed at each node, in reverse sequence to the downstream traversal of the lightpath using a 
Resv message that is created in a block 1422. 

[00129] An exemplary Resv message 2100 corresponding to the current state is shown in 
Figure 21. Many of the objects contained in Resv message 2100 contain similar values to like- 
numbered objects contained in Path message 1900, including Session object 806, and start and 
end time objects 8 10A and 810B. As discussed above, the Resv message contains a flow 
descriptor 1008 that includes a PBS_Flowspec 1010, a Filter_Spec 1012, and a 
Generalized_PBS_Label 1014. In a similar manner to PBS_Sender_TSpec 828 of Path 
message 1600, PBS_Flowspec 1010 includes a filter field 1310 value of 30%. Also, the 
Generalized_PBS_Label 1014 will have a form similar to generalized PBS label 500 discussed 
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above. In this instance, the PBS label C-D-0 corresponding to lightpath segment LP5 comprises 
•i 

the first form of the embedded label. 

[00130] After the initial Resv message is created, the logic proceeds to the looping operation 
delineated by start and end loop blocks 1423 and 1424. The first operation in the loop occurs in 
a block 1426, wherein the software reservation for the current node is upgraded to a hard 
reservation, and the corresponding resources are committed. This is reflected by changing the 
value in reservation status column 2022 from a "0" (soft, i.e., unconfirmed) to a "1" (hard, i.e., 
confirmed, meaning the resources are committed). 

[00131] Following the operation of block 1426, a determination is made to whether the source 
node has been reached in a decision block 1428. If it has, the process is completed, and all 
segments on the lightpath are reserved for a subsequent scheduled use. If not, the process 
proceeds to a block 1430 in which the Resv message and PBS label are updated for the next 
lightpath segment. The process then repeats itself for the next (now current) switching node until 
the source node is reached. At this point, all the nodes along the lightpath will have hard {i.e., 
confirmed) reservations, and the entire lightpath will be scheduled for use during the indicated 
time slot contained in the reservation table, as depicted by a block 1432. 

[00132] At this point, the reservation of the lightpath is confirmed, and the switching nodes 
are configured to switch optical signals passing through them during the scheduled time slot. 
Accordingly, data traffic (i.e., one or more data bursts) may then be sent across the lightpath, 
which begins by sending the data traffic from the source edge node to the first switching node in 
a block 1434. The data is routed along the rest of the lightpath based on the scheduled switching 
configuration of the other nodes along the lightpath, as depicted by a block 1436. 
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[00133] An additional set of continuous operations corresponding to link state updates are 
shown at the bottom of Figure 14 are also performed. In general, these operations are performed 
asynchronous to those in the continuous section discussed above, although synchronous 
operations may also be implemented. The purpose of the link state update operation is to apprise 
each edge node of the present availability of each switching node resource. The link sate data is 
generated in a block 1450, and broadcast to the edge nodes in a block 1452. As discussed above, 
in one embodiment this comprises broadcasting resource reservation data on a periodic basis to 
the edge nodes. In another embodiment, resource reservation data for individual reservations 
may be broadcast to the edge nodes in response to reservation confirmation at the switching 
nodes (real-time link state updating). In general, the link state data may be broadcast over an 
OOB management channel or an in-band channel. The advantage of periodic broadcast is that it 
reduces overhead and management channel bandwidth consumption compared with real-time 
link state updating. The advantage of real-time link state updating is that the link availability 
information stored at the edge nodes is more up to date, substantially eliminating the likelihood 
that a resource along a selected and verified lightpath will be unavailable. 

[00134] In response to receiving the updated link state data, the LINK_AVAILABILITY 
tables (or equivalent availability data) are updated at each of the edge nodes in a block 1454. 
The ideal here is to have the link availability data reflect the current link states of the network for 
given time slots, such that the lightpath that are verified as available at the edge nodes are, in 
fact, actually available when the lightpath reservation message is sent. 

[00135] A simplified block diagram 2200 of a PBS edge node architecture in accordance 
with one embodiment is shown in Figure 22. The architecture components include a 
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processor 2202, which is coupled in communication with each of a memory 2204, 
firmware 2206, optional non-volatile storage 2208, an external network interface 2210, and a 
PBS network interface 2212. The external network interface provides functionality for 
interfacing with an external network, such as a LAN, WAN, or another PBS network. PBS 
network interface 2212 provides functionality for interfacing with the internal infrastructure 
within a PBS network. The PBS network interface will generally be coupled to one or more 
fiber links, labeled as input/output fibers in Figure 22 to illustrate that the interface can support 
both input and output data transmission. 

[00136] The burst assembly and framing, burst scheduling and control, which are part of 
the PBS MAC layer and related tasks, are performed by processor 2202 via execution of 
instructions comprising a PBS module 2214, which is loaded into memory 2204 for execution. 
In one embodiment, processor 2202 comprises a network processor. Network processors are 
very powerful processors with flexible micro-architecture that are suitable to support wide-range 
of packet processing tasks, including classification, metering, policing, congestion avoidance, 
and traffic scheduling. For example, the Intel® IXP2800 NP, which has 16 microengines, can 
support the execution of up to 1493 microengines instructions per packet at packet rate of 15 
million packets per second for 10 GbE and a clock rate of 1.4 GHz. 

[00137] Functionality for performing operations corresponding to the flowcharts of 
Figures 14a, 14b, and the edge node side of the flow diagram of Figure 20 may be implemented 
via execution of firmware and/or software instructions on processors provided by the edge nodes. 
The instructions for performing these operations are collectively depicted as a router 
module 2216. Execution of the router module 2216 enables a PBS edge node to perform the 
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various routing operations discussed herein, including building and updating a 
ROUTING_TREE table 2218 and LINK_AVAILABILITY 1800. In general, the instructions 
corresponding to router module 2216 and PBS module 2214 may be stored in non-volatile 
storage 2208 or on a disk-based storage device, such as a disk drive (not shown). 

[00138] Embodiments of method and apparatus for implementing a resource reservation 
schedules in a photonic burst switching network are described herein. In the above description, 
numerous specific details are set forth to provide a thorough understanding of embodiments of 
the invention. One skilled in the relevant art will recognize, however, that embodiments of the 
invention can be practiced without one or more of the specific details, or with other methods, 
components, materials, etc. In other instances, well-known structures, materials, or operations 
are not shown or described in detail to avoid obscuring this description. 

[00139] Reference throughout this specification to "one embodiment" or "an embodiment" 
means that a particular feature, structure, or characteristic described in connection with the 
embodiment is included in at least one embodiment of the present invention. Thus, the 
appearances of the phrases "in one embodiment" or "in an embodiment" in various places 
throughout this specification are not necessarily all referring to the same embodiment. 
Furthermore, the particular features, structures, or characteristics may be combined in any 
suitable optical manner in one or more embodiments. 

[00140] Thus, embodiments of this invention may be used as or to support software program 
executed upon some form of processing core (such as the CPU of a computer or a processor of a 
module) or otherwise implemented or realized upon or within a machine-readable medium. A 
machine-readable medium includes any mechanism for storing or transmitting information in a 
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form readable by a machine (e.g., a computer). For example, a machine-readable medium can 
include such as a read only memory (ROM); a random access memory (RAM); a magnetic disk 
storage media; an optical storage media; and a flash memory device, etc. In addition, a machine- 
readable medium can include propagated signals such as electrical, optical, acoustical or other 
form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.). 

[00141] In the foregoing specification, embodiments of the invention have been described. It 
will, however, be evident that various modifications and changes may be made thereto without 
departing from the broader spirit and scope as set forth in the appended claims. The 
specification and drawings are, accordingly, to be regarded in an illustrative rather than a 
restrictive sense. 
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